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533. 


ON THE BINOMIAL THEOREM, FACTORIALS, AND DERIVATIONS. 


[From the Ozford, Cambridge, and Dublin Messenger of Mathematics, vol. v. (1869), 
pp. 102—114.] 


THE following was part of my course of lectures in the year 1867. 


The proof commonly called “Euler’s” of the binomial theorem is as follows: the 
theorem is assumed to be true for positive integer indices; that is, it is assumed that 
for any positive integer m we have 

m (m—1) 


(1 +a)"=1+ma+— g & + &e. 


This being so, since (1+a)".(1+a)"=(1+2)"*", the equation 


fı + men p MED) apa wo {1 +e” artes w+.) 
gi gets eC) a a 


is true for any positive integer values whatever of the indices m, n; the equation is 
therefore true identically ; and it is consequently true for all values whatever of the indices 
m, n. But any function ¢m of m, satisfying the functional equation dm. dn =¢ (m+n), 
is an m power, = C™ suppose: that is, we have 
; -1 
Cm =] + me + vc ) + &e., 

1.2 
where C is a constant, viz. it is independent of m; but the value of C will of course 
depend upon s; and if, in orcer to determine it, we write m=1, the equation gives 
C=1+42; that is, we have 

-1 
(l+a)"=1+me+ mien dy + &c., 


which is the binomial theorem in its general form. 
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It is to be observed that there is not in the demonstration any employment o. 
the so called “principle of equivalent forms,” we do not from the truth of an equation 
for positive integer values of m, n, infer the truth of it for any values whatever of 
m, n; there is the intermediate step, that being true for integers, it is true identically ; 
and this identical truth of the equation depends on the circumstance that comparing 
on the two sides of the equation the coefficients of the successive powers 2, a, a, &c., 
these coefficients are in every case finite, rational, and integral functions of m, n. 


For instance, comparing the coefficients of a°, we have 
(m+n) (m+n—1)= m(m—1)+ 2mn+n(n—1), 


and any such equation, being true for all positive integer values of m, n, will be 
true identically ; developing the two sides, the equation is in fact 


m+ 2mn+n?—-mM—n= mM? —M+ Wn+ Nn? — VN. 


The reasoning is thus perfectly good; but I remark that it is quite as easy to prove 
the general equation of which the last mentioned equation is an example, as it is to 
prove the binomial theorem for positive integer indices; and consequently that we can 
without the aid of the binomial theorem for positive integer indices prove the fundamental 
equation 

n(n—1) 


1.2 æ +e} 


fı tmo RD etke) {1+ ne + 


a+ &e. 


_,,m+n | (m+n)(m+n—1) 
J en a eee fe e cai Tt, 


To show this I introduce the factorial notation and write 
m(m—1)...(m—r+T)=[m]; 


this being so, the equation obtained by comparing the coefficients of a” is readily found 
to be 


[m+ ny = [m] + [mya [np +S fn fn] + &e, 
and I say that this, the factorial binomial theorem for a positive integer index r, is 


proved as easily and in the same manner as the binomial theorem for a like value 
of the index; or say as the equation 


To show how this is, I form the values of [m +n}, [m+n], [m+n], &c. successively, 
by what may be termed the process of varied multiplication: we have 


[m+n]  =m+n; 
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to obtain [m+n]? we have to multiply this by m+n—1; in regard to the first term 
m, I write the multiplier under the form (m—1)+x, and in regard to the second 
term n, under the form m+(n—1); the process then stands 


[m+n =m +n 
m+n—l=(m—1l)+n| m+(n—-1) 


m(m—1) + mn 


+mn +n(n—1) 
[m+n =m(m—-1) +2mn +n(n—1) 
= [mF +2 [m] [n] + [nf. 


To form [m+n]; we have to multiply by m+n—2; in regard to the first term, this 
is written under the form (m—2)+7; in regard to the second term under the form 
(m—1)+(n—1); and in regard to the third term under the form m+(n—2); the 
product is thus obtained in the form 


[m] + [mF [n] 
+2 [mF [n] + 2 [m]! [nF 
+ [m] [n+ [nf 
= [m} +3 [m] [n] + 3 [m] [nf + [nF 


and so on; the law of the terms is obvious; and the numerical coefficients are in 
effect obtained as follows: 


viz. by precisely the same process as is used in finding the numerical coefficients in 
the powers (m+n), (m+n), (m +n}, &c.; and we thus see that for any integer value 
r of the index, we have a factorial binomial theorem, wherein the numerical coefficients 
are the same as in the binomial theorem for the same index. 

CL PV ILE. 59 
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But the method of varied multiplication may be applied to the demonstration of 
a much more general theorem; viz. we may use it to develope a product such as 


(h — a) (h — b) (h— c) (h — d) (h — e), 
according to a series of products 

(h — a) (h — B) (h — Y) (h — 8) (h — 6), 

(h — a) (h — B) (h — y) (h — è), 

(h—a)(h— B) (h— y), 

(h — a) (h — £), 

(h — a), 

A; 


For this purpose, starting with 
h-a=h-—a+a-—a, 


we multiply by h—6, written first under the form h— 8+ -—b, and then under the 
form h—a+(a—b); we have thus 


(h —a)(h — b)=(h— a) (h— 8) 
+(h—a) {(a—a) + (8 —b)} 
+1 (a—a)(a—b), 
and so on. It is easy to see that we may for instance write 
(h— a) (h—b) (hc) (h — d) (he) 
= (h — a) (h — B) (h — y) (h — 8) (h— e) 


+h- t-e- (HH HH A) 
+ (h — a) (h — B) (h — y) be 5 & 4 .) 


+(h —a)(h— 8) ee aas d 3 
3 » UY > 3 

+h-a (2 i 6, 4, N; 

+1 ligida d, ne 
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where the symbols ( ) denote sums of products of the differences of an upper letter 
and a lower letter, 0 factors in each product; and the several sums being formed as 
follows : 


a, B, ¥, 5, € ; 
For i sc i a write & : 
y c 
ò d 
e e; 


the expression is 


(a— a) + (B —b)+ (y — c) + (è - d) +(e—e). 


For (x = i A AY write a, p a, b 
a, a, 0 
a, y a, a 
B,B | 4 ¢ 
a, Ò a, e 
By | bd 
B, ò b, e 
yy | ¢d 
y, ò Cc, € 
ò, 8 d, @; 


viz. the expression is 


(a—a)(a—b)+ (a —a)(B—c)+...+(8—d)(6—e). 


For | “S A " ki write a, @, @ a, 6, ¢ 
a,a,B a, b,d 
a, a, y a, b, e 
a, B, B a, ¢, d 
a, B, Y a, C, € 
8B, By B b, c, d 
a, y, Y a, d, e 
B, B, Y 6, 6, e 
By Y b, d, e 
YY i c, d, e; 


viz. the expression is 


(a—a)(a—b)(a—c) +... + (y — ¢) (y — d) (y — e). 
59—2 
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For m 8 


write 4, a, a, a a, b, c, d 
a, b, C, d, ab 


a,a,a,B a, b, ¢, e 
a,a, BB a, b, d, e 
a, B, B, B a, c, d, e 
B, B.P. B | d,o, d, e; 
viz. the expression is 
(a— a) (a — b) (a — c) (a — d) ... + (B — b) (8 —c) (8 - d) (8 — e). 
Finally for 


: ) ite a,a,a | a, b,c d, e; 
(a ú eal a, a, Q, Q, » 0, C, Q, €; 


viz. the expression is 
(a — a) (a — b) (a — c) (a — d) (a — e), 
which explains the law of the formation of the several coefficients. It is to be observed 
that in forming the development of any symbol, for instance b a i z r á , the first 
~ ? 2 Vp W 3 

column contains the homogeneous products, 3 together, of a, 8, y; the second column 
the combinations (that is, combinations without repetitions) 3 together of a, b, c, d, e: 
the top line is a, a, a |a, b, c and to form the subsequent lines we must for any 
advance a into 8, &c. of a greek letter make the like advance a into b, b into c, or 
c into d, of the corresponding latin letter. 


Two particular cases of the theorem may be noticed: if the latin letters all vanish, 
we have, for example, 


hë = (h — a) (h — B) (h — y) (h — 8) (h — e). 


+(h—a)(h—B)(h—-y)(h-8) - .H,(@, B, y, 8 ©) 
+ (h — a) (h— B) (h — y) . H, (a, B, y, 8) 
+ (h — a) (h — £) . H, (a, B, y) 
+(h—a) .H,(a, B) 

+1 . H, (a), 


where the symbols H denote the sum of the homogeneous products of the annexed 
letters, taken together according to the suffix number: the last coefficient H, (a) is of 
course =a", And if the greek letters all vanish, then we have in like manner 
(h—a)(h—b)(h —c) (h—d)(h—e) =k 

— kC, (a, b, c, d, e) 

+C: (a, b, c, d, e) 

— kC; (a, b, c, d, e) 

+hC,(a, b, c, d, e) 

— O,(a, b, ¢, d, e), 
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where the symbols C denote the combinations of the annexed letters taken together 
according to the suffix number; the last coefficient C,(a, b, c, d, e) is of course 
=abede. This is the ordinary theorem giving the expression of an equation in terms of 
its roots. 


Combining the two theorems, if in the first theorem we express the products 
(h -- a) (h — B) (h — y) (h — è) (h — e), &c. in powers of h by means of the second theorem ; 
or if in the second theorem we express the powers h’, ht, &c. in terms of the products 
(h—a)(h—b)(h—c)(h—d)(h—e), &c. by means of the first theorem; then in either 
case we obtain certain identical relations connecting the C, H of (a, B,...) or of 
OIR 
I have mentioned the factorial notation 
[m] =m(m—1)...(m—r +1), 
where r is a positive integer; a consequence of this is 
[m] + = [m] [m —rf, 
where r and s are positive integers; or as this may also be written 
[m + r]" + = [m + rF [m]. 
Assuming this to subsist for s=0, or a negative integer; first for .s=0, we have 
[m+r]'=[m+r]'[m}]; that is, [m] is = 1; and then for s = — r, we have 1 =[m+r]" [m] " ; 
that is, 


om ier e 1 n 

[m] = m+rF , 
and in particular, r= 1, 2, &c., we have 
1 

a eee 

[m] T= m+ 1 , 
1 

-S a k 
[m] ~ (m+1)(m+ 2)’ 
&e., 


which explains the extension of the factorial notation to negative integer values of the 
index. 

But the equation 

[m] =[m]' [m —r} 

does not in any determinate manner lead to an extension of the factorial notation to 

phos II 
fractional or other values of the index. In fact, assuming [m]” “alt , where II 
is an arbitrary functional symbol, the equation in question becomes 


sede wien. tims oT Gae #) 
I(m—r—s) I(m—r) IW (m—r-—s)’ 
viz. the original equation is identically satisfied, without any condition whatever being 


imposed upon the function II, and on this account we have not, in the notation of the 
factorial with an integer index, any sufficient basis for a theory of general differentiation. 
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A product 
m(m— a)... {m—(r—1) a} 


can of course be expressed in the factorial notation, viz. it is 


and on this account it is not in general necessary to employ a notation such as 
[m, a]” to denote such a factorial wherein the difference of the successive factors instead 
of being =—1 is =—a; in particular cases where factorials of the kind in question 
are used, it may be convenient to employ such a notation. In particular it is some- 
times convenient to use the notation [m, —1]" or better {m}" to denote the product 


m(m+1)...(m+r—1), 


where the successive factors instead of being diminished, are increased by unity. It 
may be noticed, that reversing in this last product the order of the factors, we find 


{m} =[m+r—1]; 


a somewhat similar formula, but employing only the ordinary factorial notation, is 
obtained from the equation 
[m] =m(m—1)...(m—r+]), 


by first changing the sign of m and then reversing the order of the factors; viz. we have 
[— mf} =(—)" [m+r—-1f]. 


Reverting to the process used for the development of the expressions ( )s, where there 
are two columns, the one of greek, the other of latin letters; it is to be remarked 
that although the order in which the successive lines are evolved is not material for 
the purpose of the theorem, yet that a certain definite order of evolution has been 
a, B, y, 8 

a, b, c, d, e 
the homogeneous products of the second order in (a, 8, y, 5), was 


made use of; thus in regard to ( y the column of greek letters, giving 
2 
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this is evolved from the top term (a, a) by a process given implicitly in Arbogast’s 
Calculus of Derivations, and which may be termed the rule of the last and the last 
but one. Let the direction “operate on any letter,’ be understood to mean that the 
letter in question is to be changed into that which immediately follows it, but in 
such wise that when the letter occurs more than once, e.g. as in a, a the operation 
affects only the letter in the right-hand place. Then operate on the a, @ in regard 
to a, we obtain a, 8; operat: on this in regard to 8, we obtain a, y; and in regard 
to a, we obtain 8, 8. Again we operate on a, y in regard to y, and obtain a, ô; we 
do not operate on it in regard to a for the reason that a is not the letter immediately 
preceding y. Operate on £, 8 in regard to 8, we obtain B, y. The next step, if the 
series extended to e would be to operate on a, 8 in regard to 8, giving a, e; do not operate 
on it in regard to a, for the reason that a is not the letter immediately preceding 6. 
But in the example, since the series does not extend to e, there is no operation on 
a, 6. Passing then to the next term £, y, we operate in regard to y, obtaining £, 6, 
and since 8 is the letter immediately preceding y, we also operate in regard to £, 
obtaining y, y. Similarly if e were admissible, £, 5 would give £, e, but it in fact 
gives nothing; y, y gives y, ô; thus if e were admissible would give y, e and 6, ô, 
but it in fact gives only è, 8, and, e being inadmissible, the process is here concluded. 
The rule is, operate on the last letter, and when the last but one letter is that 
which, in alphabetical order, immediately precedes the last letter (but in this case 
only) operate on the last but one letter. 


Taking another example, but with numbers instead of letters, and supposing the 
highest admissible number to be 5, then from 111 we derive as follows: 


111 112 113 114 115 125 185 145 155 255 355 455 555 
122 123 124 184 144 235 245 345 445 
222 133 224 225 244 335 
223 233 234 334 344 
333, 


the original single column being here for greater convenience broken up into distinct 
columns; but the order of the terms, when the columns are taken one after the other 
in order, each being read from the top to the bottom, being the same as before; it 
will be noticed that the successive divisions are the partitions into 3 parts (no part 
exceeding 5) of the numbers 3, 4, ..., 15 respectively; the partitions being in each 
ease obtained without repetition, and those of the same number being given, say in 
their numerical order (corresponding with the alphabetical order when letters are 
employed). It is necessary to show that the partitions will be obtained without 
repetitions; and that all the partitions will be obtained; for this purpose consider, for 
example, the partitions of 9; any one of these is either a partition 135 where the last 
number 5 is not a repeated number; and in this case there is a partition of 8, viz. 
134, from which operating on the last we obtain 135; but there is no other partition 
of 8 which would give 135, the only such partition would be 125, but here, as 2 is 
not the number which immediately precedes 5, there is no operation on the last but 
one, and we do not from it obtain 135. Or else a partition of 9 is of the form 144 
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where the last letter is repeated; there exists in this case no partition of 8, such 
that operating on the last we obtain from it 144, but there does exist a partition 
of 8, viz. 184, such that operating on the last but one we obtain from it 144. That 
is, for any partition whatever of 9 there exists one (and only one) partition of 8, 
such that operating on the last or the last but one, we obtain from it the partition 
of 9; that is, taking the entire system of the partitions of 8, and operating on the 
last and the last but one, we obtain, and that without repetitions, the entire series 
of the partitions of 9; and so in general. 

Translating the example into letters, but using for greater convenience a, &c. 
instead of a, a, &c. the process will be precisely the same; taking the letters to be 
a, b, c, d, e, we have 

a’ @b ‘ac “@d ate “abe ace “ade ae be ce “de e 
ab? abc abd acd ad? bee bde cde de 
OF ae AT O30, et Da A OUO E one oat 
Bc. bc? bed. cd cd? 
sal 
Attributing weights to the several letters, viz. to a, b, c, d, e the weights 1, 2, 3, 4, 5 
respectively, the several columns show the terms of the weights 3, 4,...15 respectively. 

I have said that the foregoing rule is given implicitly in Arbogast’s Calculus of 

Derivations; this calculus includes in fact a process for the expansion of a function 


h (a + ba + ca? + dz? + &e.) 


in powers of æ; the expansion in question may be obtained by means of Taylor's 
theorem, viz. 


o (a + bx + cæ + da? + &c.) 
= pa 

p'a 
si > (bx + ca? + da? + &c.) 
+ _ (ba + ca? + da? + &c.)? 


+ (fo “a (bo + cat + da + ke.) 


viz. expanding the several powers of the polynomial increment, and arranging in powers 
of æ, this is 


= pa 
+a (a.b) 

/ n b? 
+a (pa.cto a. 

/ Mu wh b 
+æ(ga.d+o a. bc + o) a.g) 

/ “/ LA bc wy bt 
+ ai p'a.e+ $a. (bd+h4e°)+¢ a.z +o a. gg) 


+ &c. 
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but the object of the rule is to obtain this last-mentioned result directly, and under- 
standing “operate in regard to a letter” to mean differentiate with regard to this 
letter and integrate with respect to the next succeeding letter, then the coefficients 
of the successive powers of æ are all obtained from the first coefficient da, by operating 
thereon according to the rule of the last and the last but one; thus ġa, operating 
on a gives ¢’a.b; this operating in regard to b gives ¢’a.c, and in regard to œ gives 


2 

a. the term ¢’a.c is to be operated upon in regard to c only, and it gives 
2 

ga.d; the other term g'a.” operated on in`regard to b gives $”a.bc, and in regard 


3 
to a it gives ga. and so on. But attending only to the literal parts, the terms, 


for instance b?, bc, bd, &c., which present themselves in the formula, are the homogeneous 
Cc? 

terms derived from b, by the rule, as originally stated, with a view to the derivation 

of such terms. 


Cc. VIII. 60 
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